Eur. Phys. J. C manuscript No. 

(will be inserted by the editor) 



Top quark mass measurements at and above threshold at CLIC 

Katja SeideP, Frank SimoipEl Michal TesafS, Stephane PosP 

1 Max-Planck- Institut fiir Phy sik, Fohringer Ring 6, 80805 Munich, Germany 
2 CERN, 1211 Geneva, Switzerland 



o 

(N 



in 



x 

i 



> 
00 

in 

cn 

cn 
o 



X 



Received: date / Accepted: date 

Abstract We present a study of the expected precision of 
the top quark mass determination, measured at a linear e + <?~ 
collider based on CLIC technology. GEANT4-based detec- 
tor simulation and full event reconstruction including real- 
istic physics and beam-induced background levels are used. 
Two different techniques to measure the top mass are stud- 
ied: The direct reconstruction of the invariant mass of the top 
quark decay products and the measurement of the mass to- 
gether with the strong coupling constant in a threshold scan. 
For both techniques, total uncertainties of around 100 MeV 
or better are achieved, including a first study of expected 
systematic uncertainties. For the threshold scan, the preci- 
sion at ILC is also studied to provide a comparison of the 
two linear collider technologies. 

1 Introduction 

As the heaviest Standard Model particle, the top quark is 
of particular interest since it couples most strongly to the 
Higgs field and may provide sensitivity to Beyond the Stan- 
dard Model physics. Its mass is a key parameter provid- 
ing, together with the Higgs mass, constraints on the sta- 
bility limit of the Standard Model [ 1 ]. Currently, the uncer- 
tainty of the top quark mass is the leading uncertainty in 
this evaluation [2|. At the Tevatron, the top mass has now 
been determined with a combined precision of 1 GeV to 
173.18 ± 0.56 (stat.)± 0.75 (syst.) GeV 0, and the com- 
bined LHC results at 7 TeV reach a total error of 1.4 GeV 
with 173.3 ±0.5 (stat.) ±1.3 (syst.) GeV ffl. In both cases, 
the largest uncertainties are systematic, which are expected 
to improve only slowly as the analyses progress. 

Experimentally, the top quark offers the unique possibil- 
ity to study an almost free quark, since its lifetime is con- 
siderably shorter than the QCD hadronization time. Due to 
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its large mass, it is also the only quark so far not studied 
in electron-positron annihilation. Experiments at e + e~ col- 
liders offer the possibility for a wide variety of studies in- 
volving top quarks, ranging from the precise measurement 
of the top quark mass and width to the investigation of asym- 
metries providing large sensitivity to various New Physics 
models 0. 

At such a collider, the mass of the top quark can be 
determined with two different techniques: through the di- 
rect reconstruction of top quarks from their decay products 
at energies above the production threshold, and through a 
scan of the top-pair production threshold. While the invari- 
ant mass is an experimentally well-defined observable, its 
interpretation in terms of the top quark mass in a theoret- 
ically clean definition suffers from potentially substantial 
uncertainties due to non-perturbative contributions. Progress 
has been made recently in establishing connections between 
the top mass parameters used in theory and experimentally 
accessible parameters E1I7), but theoretical uncertainties re- 
main on the level of several 100 MeV. The top quark pair 
production cross-section near threshold, on the other hand, 
can be calculated with a high degree of precision using the- 
oretically well-defined top mass definitions, such as the IS 
mass of Hoang et al. [8 |, which then can be transformed into 
other commonly used mass schemes. A scan of the tt thresh- 
old thus provides the possibility for a precise mass measure- 
ment with well-controlled theory uncertainties. 

In this paper, we study the capabilities of a linear e + e~ 
collider based on CLIC technology |9| for precise measure- 
ments of the top quark mass using both experimental ap- 
proaches. The studies are performed using GEANT4 ifTUl 
- based simulations of signal and relevant background pro- 
cesses including machine-related backgrounds in a detailed 
detector model with realistic modeling of the detector re- 
sponse and with event reconstruction. In Section |2j the ex- 
perimental conditions and the detector concepts at CLIC are 
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outlined, followed by a summary of the event generation, 
simulation and reconstruction in Section[3]and a description 
of the techniques used for the identification and reconstruc- 
tion of top quark events in Section [4] The results for the 
top mass obtained with direct reconstruction of the invari- 
ant mass at ^/s = 500 GeV are given in Section [5] while the 
threshold scan at energies around 350 GeV is described in 
Section [6] The latter also includes a study of the effects of 
the CLIC luminosity spectrum on the threshold scan com- 
pared to that of the ILC, which has a somewhat more peaked 
spectrum, by repeating this analysis using the ILC spectrum. 

2 Experimental conditions and detectors at CLIC 

The Compact Linear Collider CLIC is a collider concept 
based on normal conducting accelerating cavities and two- 
beam acceleration f9|- It is designed to provide collision en- 
ergies up to 3 TeV. The project is foreseen to be implemented 
in several stages ifTTI to provide optimal luminosity condi- 
tions at lower energies. Here, we consider a possible first 
stage of CLIC with a maximum design energy of 500 GeV, 
to be operated both at the tt threshold around 350 GeV and 
at 500 GeV. 

The experimental conditions at CLIC are mainly influ- 
enced by three factors: the luminosity spectrum, background 
from two-photon processes and the bunch crossing frequen- 
cy of 2 GHz. The small beam spot size required to achieve 
a luminosity of 2.3 x 10 34 cm~ 2 s~' leads to substantial en- 
ergy losses due to beamstrahlung, resulting in a luminosity 
spectrum with 61% of the luminosity in the top 1% of the 
energy at 500 GeV. When operated at 350 GeV, the lumi- 
nosity is 1.1 x 10 34 cm~ 2 s~', with 77% of that in the top 
1%. Radiative effects, together with the high beam energy, 
result in substantial cross sections for two-photon processes. 
Of particular relevance are 77 — > hadrons processes, which 
deposit additional energy in the full detector acceptance. At 
500 GeV, on average 0.3 such events are expected per bunch 
crossing, with an energy of 13.3 GeV, 3.4 GeV of which is 
deposited in the calorimeter system. Together with the short 
bunch-to-bunch spacing of 0.5 ns, this leads to a pile-up of 
background from many bunch crossings over the 177 ns long 
bunch trains for typical detector integration times of the or- 
der of a few to several tens of nanoseconds. At an energy 
of 350 GeV, the rate of 77 — > hadrons is reduced to 0.046 
events per bunch crossing. 

The detector concepts for CLIC lfT2l are based on the 
two ILC concepts ILD [13) and SiD O, since the per- 
formance requirements for ILC and CLIC are similar. For 
the CLIC detectors, design modifications motivated by the 
higher energy of up to 3 TeV and by the more challeng- 
ing experimental conditions such as the 0.5 ns bunch spac- 
ing and the higher rate of incoherent pairs are implemented. 
The detector concepts provide highly efficient tracking with 



excellent momentum resolution in a solenoidal field, pre- 
cise secondary vertex reconstruction and highly segmented 
calorimeters optimized for jet reconstruction using Particle 
Flow Algorithms. To provide the depth of the calorimeters 
necessary for multi-TeV operation, the barrel hadronic calo- 
rimeter uses tungsten absorbers. The inner radius of the ver- 
tex detectors is increased compared to the ILC to account 
for the larger beam crossing angle and the higher rate of 
incoherent e + e~ pairs produced in the collision. While the 
calorimeters are expected not to change for the different en- 
ergy stages of CLIC, the vertex detector used at 500 GeV 
and below can be closer to the interaction point than the one 
used at 3 TeV. 

For the present analyses, the 500 GeV version of the 
CLIC_ILD [12 1 detector is used. This detector concept uses 
a low-mass pixel vertex detector with an innermost radius 
of 25 mm, intermediate silicon strip tracking, a TPC as main 
tracker, a silicon-tungsten electromagnetic calorimeter and a 
hadron calorimeter system with scintillator tiles read out by 
silicon photomultipliers with tungsten absorbers in the bar- 
rel and steel absorbers in the endcap region. The full detec- 
tor system is located within a 4 T solenoid, with additional 
muon tracking in the instrumented return yoke. A detailed 
model of the detector has been implemented in GEANT4, 
and the full reconstruction software including particle flow 
reconstruction has been used in the present analyses. 

3 Event generation, simulation and reconstruction 

The signal process studied here is top quark pair production, 
e + e~ — > tt, which, at a 500 GeV CLIC collider, has a cross 
section of 530 fb. The top quark decays almost exclusively 
into a W boson and a b quark. The signal events can thus be 
grouped into different classes, according to the decay of the 
W bosons. These are 

- the fully -hadronic channel e + <?~ — ^ tt — > qqbqqb, with 
both Ws decaying into quark pairs with a branching ratio 
of 46% 

- the semi-leptonic channel e + e~ — > tt —> qqblvb , with 
one W decaying into quarks, the other into a lepton and 
a neutrino with a branching ratio of 15% for each of the 
three lepton flavors 

- the fully -leptonic channel e + e~ — > tt — > Ivblvb, with 
both Ws decaying into a lepton and a neutrino, with a 
branching ratio of 9%. 

In the leptonic channels, the decay into a T and a neutrino is 
a special case, since the z decays almost instantly into either 
a lepton and two neutrinos or into one or more hadrons and a 
neutrino, giving rise to additional missing energy in the final 
state, and potential confusion with hadronic decay modes. 

In the analyses discussed below, only fully-hadronic and 
semi-leptonic events, excluding T final states, were selected, 
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type 


final 


a 


a 




state 


500 GeV 


352 GeV 



Signal (m top = 174 GeV) 




530 fb 


450 fb 


Background 


WW 


7.1 pb 


11.5 pb 


Background 


zz 


410fb 


865 fb 


Background 


qq 


2.6 pb 


25.2 pb 


Background 


WWZ 


40 fb 


10 fb 



Table 1 Signal and considered physics background processes, with 
their approximate cross section calculated for CLIC at 500 GeV and 
at 352 GeV. 



since these provide the best mass resolution and the clearest 
identification of top quark events. This corresponds to 76% 
of all top quark pair decays. However, to account for imper- 
fect event classification, all possible decay modes of the it 
pair were generated according to their branching fractions 
and included in the signal sample. The top mass and width 
were fixed for the signal event sample generation to m to p = 
174.0 GeV and a t o P = 1-37 GeV. 

In addition to the signal, background processes with sim- 
ilar event topologies have to be considered. These are mostly 
four and six fermion final states, with the high cross section 
quark pair production in addition. Table [T] lists the studied 
processes, with cross sections at a CLIC machine operated 
at 500 GeV and 352 GeV. The background samples were 
generated for an integrated luminosity of 100 fb , with ad- 
ditional events produced for the signal process to provide 
high-statistics data sets for signal-only studies to determine 
the detector resolution function. In addition, the processes 
e + e~ — > qqe + e~ and e + e~ — > qqev, which are dominated by 
t-channel single boson production, were investigated using 
samples with reduced statistics. It was shown that the non- 
di-boson contributions are rejected completely in the analy- 
sis. Since the di-boson contributions are accounted for in the 
e + e~ — > WW and e + e~ — > ZZ modes, these additional final 
states were not considered separately. 

In addition to the main production of signal and back- 
ground processes, a signal sample with a different top quark 
mass and width (m top = 175 GeV, a top = 1.5 GeV) was gen- 
erated to study possible systematic effects in the results de- 
pending on the input parameters used to derive the detector 
resolution function. 

PYTHIA ITT31 was used to generate the signal process 
e + e~ — > tt as well as the two background processes e + e~ — > 
WW and e + e~ — > ZZ, since this generator correctly accounts 
for a non-zero width of explicitly defined intermediate and 
final states. The processes e + e~ — > qq, e + e~ — > qqe + e~ and 
e + e~ — > qqev were generated with WHIZARD ifTBI . which 
was used as the default event generator for the CLIC Con- 
ceptual Design Report benchmark studies. Since the back- 
ground process e + e~ — > WWZ is not implemented in PY- 
THIA, WHIZARD was used for its generation. For sim- 
plicity, these events were generated with zero width for the 



bosons, which allowed to specify the tri-boson final state ex- 
plicitly prior to the decay into six fermions. 

For all generated events, the luminosity spectrum of the 
500 GeV CLIC machine was considered, and beam-induced 
background in the form of yy — > hadrons events was added. 
More information on the beam induced backgrounds can be 
found in lfl2l . The events were simulated with GEANT4 us- 
ing the CLIC_ILD detector model defined in Mokka fPTl . 
300 bunch crossings of yy — > hadrons events were overlaid 
with the signal event at the digitization stage. For 500 GeV, a 
Poissonian probability with a mean of 0.3 events per bunch 
crossing is used. At 350 GeV, the mean reduces to 0.046. 
The tracking and particle flow event reconstruction [18] was 
then performed on the combined event comprising signal 
and beam-induced background. Mitigating the effect of this 
background requires precise time stamping to assign energy 
deposits to individual bunch crossings, and cuts based on 
the transverse momentum of particles, since the background 
particles are predominantly at low pj in the forward and 
backward regions of the detector. The applied cuts are doc- 
umented in |[T2l[T8l . The data analysis is then performed on 
reconstructed particles (Particle Flow Objects - PFOs) after 
the application of these cuts. 

4 Top quark reconstruction at CLIC 

The identification of top quark pair production events and 
the rejection of non-ff background proceeds in several steps. 
The analysis strategy outlined in the following is used both 
above and at threshold. The quality cuts applied differ for 
the two mass measurement methods, since for the direct re- 
construction of the mass the quality of the selected events is 
of key importance, while for the cross section measurement 
in a threshold scan the emphasis is on maximising the signal 
significance. 

As a first step, all events are classified according to the 
number of isolated leptons (electrons and muons) in order 
to identify the top pair decay modes tt —> qqbqqb and tt — > 
qqblvb which are used further in the analysis. The events 
are classified as fully-hadronic (no isolated lepton found), 
semi-leptonic (exactly one isolated lepton found) or fully- 
leptonic (at least two isolated leptons found) top pair candi- 
dates. Events classified as fully-leptonic are rejected, while 
the other two classes are clustered into four or six jets, ac- 
cording to event class. Following this, a flavor tagging algo- 
rithm is used to identify the two jets most likely originating 
from b quarks. For the fully-hadronic channel the correct 
combination of the four non-Z? jets into W bosons has to be 
found among the three possible combinations, while for the 
semi-leptonic case the assignment of light jets and leptons 
to W candidates is unique. The pairing of W candidates and 
b jets into the two top candidates is done with a kinematic fit 
exploiting energy and momentum constraints to improve the 
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top mass measurement. This step also provides strong rejec- 
tion of non-ff background since such events tend to lead to 
a failure of the fit by not satisfying the imposed constraints. 
Additional background rejection is achieved with a binned 
likelihood technique, after which a highly pure sample of 
top quark pair events is available for the reconstruction of 
the invariant mass or the measurement of the cross section. 
In the following, each of these steps is briefly described. 

4. 1 Lepton finding 

The classification into fully-hadronic, semi-leptonic and ful- 
ly-leptonic events is based on the identification of isolated 
leptons using a lepton finder as a first step of the analysis. 
It is optimized to identify charged leptons (e or fi ± ) origi- 
nating from the decay of W bosons. Since these leptons are 
typically highly energetic, and, in contrast to leptons orig- 
inating from hadronic decays in quark jets, well separated 
from other activity in the event, isolation and energy are 
used as selection criteria. A minimum lepton energy of 10 
GeV is required. If there is no other charged particle with an 
energy larger than 2.5 GeV within a cone with an opening 
angle of 10° around the lepton track, it is considered to be 
isolated. The cut values were optimized in a parameter scan 
with tt events using generator level simulation information. 

The residual yy — > hadrons background leads to a slight 
deterioration of the quality of the event classification, re- 
ducing efficiencies by 1% to 2% compared to the case with- 
out additional background. Overall, 96% of the all-hadronic 
events and 91% of the semi-leptonic events (ignoring final 
states with T leptons) are correctly classified. At this stage, 
56% of all input events are classified as fully-hadronic and 
33% as semi-leptonic candidates. The remaining 11% are 
rejected. Out of the accepted events, in particular the fully- 
hadronic, but also the semi-leptonic samples receive contri- 
butions from events with T final states which are reduced in 
further steps of the analysis. 

4.2 Jet clustering 

Jet clustering of the events is performed in exclusive mode, 
meaning that events are clustered into a fixed number of jets. 
According to the classification of the lepton finder, events 
are clustered either into 6 jets (fully-hadronic) or into 4 jets 
(semi-leptonic). In the latter case, the identified isolated lep- 
ton is excluded from jet finding. 

Jet finding is performed with the k t algorithm lfT9l im- 
plemented in the FastJet package Il20l |2T1 . This algorithm 
uses the pseudorapidity tj and the azimuthal angle as the 
basis to define the two-particle distance. This metric results 
in an increase of the two-particle distance in the forward re- 
gion and makes the algorithm robust against yy — > hadrons 



background since these mainly forward-going particles tend 
to be clustered into the beam jets. These are implemented in 
the algorithm to account for beam remnants in the case of 
hadron collisions and are not considered further in the anal- 
ysis. In the present study, the k t jet algorithm is used with 
a jet size parameter R of 1.3, which is selected as the best 
trade-off between the requirements to limit the loss of sig- 
nal particles on the one hand and to limit the inclusion of 
background on the other. 

4.3 Flavor tagging 

Efficient ^-tagging is essential for the identification of tt — >■ 
(bqq)(bqq) and tt — > (bqq)(blVi) events compared to multi- 
fermion background, and is also crucial for the correct as- 
signment of jets to top candidates for signal events. In the 
analysis, flavor tagging is performed using the LCFI Flavour 
Tagging lt22l package. This algorithm is based on a neural 
network which provides b and c jet probabilities ("b-tag") 
for each jet in the event, depending on a number of input 
variables such as reconstructed secondary vertices, particle 
momenta and impact parameters. The training of the neu- 
ral network used for the tagging is performed with a fully- 
hadronic sample of tt events. These events are generated 
and reconstructed without beam spectrum, initial state ra- 
diation and top width, but do contain all other generation, 
simulation and reconstruction details, such as overlaid yy — >• 
hadrons background. 

4.4 Jet pairing 

For both analysis branches (6-jet fully-hadronic and 4-jet 
semi-leptonic candidates), the two jets with the highest b- 
tag values are classified as jets created by a b quark (b jets). 
All other jets in the event are classified as light-jets (created 
by u, d, s or c quarks) originating from the W boson decay. 

In case of the 4-jet sample all decay products of the tt 
pair are found at this stage of the analysis: Two b jets, two 
light-jets forming one W boson, one charged lepton and a 
neutrino forming the other W boson. As the neutrino is un- 
measured, its momentum is taken to be the total missing 
momentum in the event. Since the neutrino absorbs all en- 
ergy reconstruction uncertainties, and since the missing en- 
ergy measurement also includes contributions from the lu- 
minosity spectrum, the leptonic W boson mass is less well 
constrained than the hadronic W boson mass. In the 6-jet 
case, the correct pairing of light-jets into W bosons has to 
be identified among the three possible permutations of light- 
jet pairs. For each permutation the invariant mass of both 
jet pairs is calculated and compared with the true W boson 
mass. The permutation with the minimal value of 

v=\mij-mw\ + \m k i-m w \, (1) 
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Fig. 1 Distribution of the reconstructed mass of the W bosons for a 
signal-only sample after the classification into fully-hadronic and semi- 
leptonic candidates before further event selection for collisions at 500 
GeV. The distributions are normalized to allow a shape comparison. 
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Fig. 2 Reconstructed top mass for signal-only events selected in the 
fully hadronic branch with and without kinematic fit for collisions at 
500 GeV. For both distributions only events with a successful kinematic 
fit are shown. 



where mw = 80.4 GeV and my and my are the invariant 
masses of two distinct jet pairs, is chosen as the correct com- 
bination. 

Figure [T] shows the normalized distribution of the recon- 
structed invariant mass for all W candidates in a signal only 
sample. For the fully-hadronic analysis branch, only the W 
candidates chosen according to Equation [1] are shown. The 
distribution for the semi-leptonic candidates has more pro- 
nounced tails due to the aforementioned additional uncer- 
tainties entering for the neutrino in the leptonically decaying 
W. 



4.5 Kinematic fitting 

After the identification of b jets and the pairing of light jets 
and leptons into W bosons, the next step of the analysis is 
the correct grouping of W boson candidates and b jets into 
top quarks. This assignment is performed using a kinematic 
fit. Out of the two possible combinations, the one with the 
higher probability of the kinematic fit result is chosen. 

The kinematic fit 11231 uses constraints based on the as- 
sumption of a tt event to improve the precision of the event 
parameters of interest. The parameters of interest are calcu- 
lated / fitted using a least squares technique and the physical 
constraints are incorporated into the fit using Lagrange mul- 
tipliers. In the present analysis, all constraints are imposed 
as hard constraints, meaning that they have to be fulfilled 
exactly. The input to the kinematic fit in case of the 6-jet 
sample are the four-momenta of the light jets, already paired 
into W bosons, and the four-momenta of the b jets. In case 
of the 4-jet sample, the input values are the four-momenta 



of the light jets and b jets as well as of the isolated lepton. In 
the latter case, the unmeasured neutrino is also represented 
in the fit, with starting values set to the measured missing en- 
ergy and momentum in the event. The constraints imposed 
are full three dimensional energy and momentum conserva- 
tion taking into account the crossing angle of the beams, the 
correct reconstruction of the W boson masses to the nomi- 
nal mass of mw = 80.4 GeV, and an equal mass of the two 
top candidates. In order to fulfill these constraints, the mea- 
surements, such as jet and lepton energies and angles, are 
allowed to vary from their measured values within the ex- 
perimental resolution. 

The kinematic fit fails if it is unable to satisfy the con- 
straints outlined above. The failures are due to several ef- 
fects such as the wrong classification into the semi-leptonic 
and fully-hadronic event branch, imperfect jet clustering, W 
boson reconstruction from the wrong jet combination, too 
large remaining yy — > hadrons background and the influence 
of the luminosity spectrum. 

It is observed that some of the fit failures are due to the 
wrong identification of one of the b jets. This is particularly 
likely in the case of a W decaying into a charm quark and 
another light quark. Thus, to improve the number of suc- 
cessful fits and to account for possible wrong flavor tagging, 
the jet pairing into W candidates and the kinematic fit are re- 
peated for unsuccessful kinematic fits after exchanging the 
b jet with the lower b-tag value and the light-jet with the 
highest b-tag value. This procedure increases the number of 
successful kinematic fits by approximately 20%. 

Overall, 36.7% of the signal events classified as fully- 
hadronic candidates and 46.5% of the signal events classi- 
fied as semi-leptonic candidates pass the kinematic fit sue- 
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cessfully at 500 GeV. This failure rate reflects the optimiza- 
tion of the analysis towards cleanly reconstructed events to 
provide the best possible mass measurement. Figure[2]illus- 
trates the influence of the kinematic fit on the reconstructed 
top quark mass by comparing the mass distributions before 
and after the kinematic fit for fully-hadronic events with suc- 
cessful fits. 

At threshold, fit success rates of more than 90% are ob- 
tained for both the fully-hadronic and the semi-leptonic anal- 
ysis branch. This higher success rate is due to more con- 
strained kinematics directly at the production threshold and 
reduced two-photon backgrounds as well as due to reduced 
requirements on the fit quality. These reduced requirements 
are possible since at threshold the highest possible signifi- 
cance and not the cleanest possible reconstruction is the goal 
of the analysis. 

By imposing requirements on the kinematic structure of 
the events, the kinematic fit also serves as a powerful tool 
for rejecting non-ff background. More than 97% of the qq 
background is rejected by the fit in both branches, and more 
than 94% of the di-boson pair production events grouped 
into the fully-hadronic analysis branch. In the semi-leptonic 
branch, ZZ events are rejected with an efficiency of 94%, 
WW events with 86%. The WWZ background, which can 
result in final states identical to those of top pair events and 
has similar kinematics overall, is only rejected at the level of 
13% for fully-hadronic and 25% for the semi-leptonic event 
candidates. 



The variables entering the likelihood are the two highest 
fo-tags, the sphericity and the overall number of particles in 
the event, the masses of the W bosons, the difference of the 
two top masses without kinematic fit, and the d cut variable 
of the jet clustering algorithm. The d cut variable provides a 
measure of how natural it is to cluster a given event into the 
defined number of jets rather than fewer jets. Since it events 
are characterized by a higher jet multiplicity than most of 
the background channels, this variable provides a good sep- 
aration of signal and background. 

For signal events passing the kinematic fit, an efficiency 
of 93.5% (94.4%) is achieved for the fully-hadronic (semi- 
leptonic) analysis branch. At the same time, 98.0% (96.7%) 
of the background events remaining after the kinematic fit 
are removed by this additional background rejection step in 
the fully-hadronic (semi-leptonic) event sample. Together 
with the kinematic fit, 99.8% (99.7%) of all background 
events are rejected. 



5 Top mass measurement above threshold 

At energies substantially above the production threshold, in 
this paper studied at 500 GeV, the top quark mass is mea- 
sured from the invariant mass of the reconstructed decay 
products. 



4.6 Additional background rejection 

Beyond the rejection of non-ff background by the kinematic 
fit, additional signal and background discrimination is pro- 
vided by a binned likelihood technique 11241 which com- 
bines several discriminating observables into one likelihood 
variable. For the two event classes j, "signal" and "back- 
ground", probability density functions f J (xi) for each dis- 
criminating variable Xj are provided as input to the likeli- 
hood algorithm. The probability p J (xi) for a given event to 
belong to event class j for a given value of the discriminat- 
ing variable x, is given by 



P>{Xi) ~ 



fK*i) 



(2) 



where k runs over all event classes. 

The final likelihood for an event belonging to the signal 
event class S, combining the probabilities of the individual 
discrimination variables, is given by 

UiP S {xi) 



HkIliP k (xi)' 



(3) 



with i running over all discrimination variables and k over 
all event classes. 



5.1 Determination of top quark mass and width 

In this analysis, the distribution of the reconstructed invari- 
ant mass for signal and background events after kinematic 
fit and background rejection is considered. Since the kine- 
matic fit imposes the constraint of equal masses of the top 
and the anti-top in the 1 1 pair, each event provides one unique 
measurement of the mass and thus enters only once in the 
distribution. 

Figure[3]shows the reconstructed top mass for signal and 
background after kinematic fitting and background rejection 
for an integrated luminosity of 100 fb _1 recorded at 500 
GeV. From this distribution, the top mass is extracted us- 
ing an unbinned likelihood fit. The fit function consists of 
three components, the first of which accounts for physics 
background, while a convolution of the other two describes 
the resolution and the signal itself. Prior to the final fit, a 
parametrization of the background distribution is obtained 
from a fit to a background-only sample and the influence 
of the detector resolution is determined with a statistically 
independent signal-only sample corresponding to twice the 
integrated luminosity considered in the analysis. 

The background is parametrized by a threshold function 



bgd — (x — a) b 



(4) 
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Fig. 3 Distribution of reconstructed top mass for events classified as fully-hadronic (left) and semi-leptonic (right). The data points include signal 
and background for an integrated luminosity of 100 fb -1 . The pure background contribution contained in the global distribution is shown by the 
green solid histogram. The top mass is determined with an unbinned likelihood fit of this distribution, which is shown by the solid line. 



in which the threshold is fixed to a = 100 GeV and b is 
left as a free parameter which is determined with a fit to 
a background-only sample. 

For the signal fit, the following function is used: 

signal = /•BreitWigner(m bw ,(7bw) ( 8> res (5) 
+{l-f)- tail (6) 

This fit consists of two main components, a signal part de- 
scribed by a Breit-Wigner convolved with a resolution func- 
tion res implemented as the sum of three Gaussians, and a 
background part labeled tail to account for the high-mass 
tail observed in the signal at masses around 230 GeV. This 
tail is due to kinematic reflections in events with incorrect 
assignments of jets and/or leptons to top candidates. The 
mean value of the Breit-Wigner is given by mt, w , and the 
corresponding width by dbw It i s found that the tail is well 
modelled by a simple Gaussian. The relative fraction of the 
two components is described by the factor /. 

The resolution component of the signal part is described 



by 

res = fl • Gauss l(x,ml,,sl) + (7) 

/2-Gauss2(x,m2,s2) + (8) 

(1-/1-/2). Gauss3(jt,m3,j3), (9) 



where the notation Gauss(x, mean, sigma) is used and fl 
and /2 are the fractions of the Gaussians in the sum. This 
function does not only represent the detector resolution, but 
also accounts for systematic effects introduced by the anal- 
ysis chain and by the pick-up of yy — > hadrons background. 
The resolution function is determined with a fit to a sepa- 
rate signal-only event sample for which the mean and the 



channel 


'"top 


4m t0 p 


Otop 




fully-hadronic 


174.049 


0.099 


1.47 


0.27 


semi-leptonic 


174.293 


0.137 


1.70 


0.40 


combined 


174.133 


0.080 


1.55 


0.22 



Table 2 Results summary for the top mass measurement at 500 GeV 
for an integrated luminosity of 100 fb -1 . All numbers are given in units 
of GeV, Errors are statistical only. 



width of the Breit-Wigner component are fixed to the gen- 
erator values of mt, w = 174 GeV and Obw = 1-37 GeV. All 
parameters of the Gaussian sum are left free in this fit. 

The full probability density function of the final fit for 
the top mass distribution containing signal and background 
is given by the sum of the signal and background functions, 

pdfm = y s i g • signal +yb g d ■ bgd, (10) 

where y s i g and ybgd describe the signal and the background 
yield, respectively. In the final fit to the overall top mass 
distribution including signal and background contributions, 
fixed values for the Gaussians of signal and bgd are used, 
leaving only m bw , Ob W , y s ig and ybgd as fi" ee parameters. The 
fit, shown by the lines in Figure [3] is performed indepen- 
dently for the fully-hadronic and for the semi-leptonic events. 

The results of the fit are summarized in Table [2] The 
determined masses and widths are in good agreement with 
the generator values, with a 2a deviation observed for the 
reconstructed mass in semi-leptonic events. Combining the 
results of both event classes, a statistical uncertainty of 80 
MeV for the top quark mass is obtained for an integrated 
luminosity of 100 fb . 
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The uncertainty on the width is considerably larger since 
the width of the reconstructed mass distribution is domi- 
nated by the resolution function, which incorporates detec- 
tor and reconstruction effects as well as the influence of the 
collider luminosity spectrum. Still, a combined uncertainty 
of 220 MeV is reached with the present fit function. 

5.2 Systematic studies 

While a full study of possible systematic uncertainties is be- 
yond the scope of the current work, two potentially sizable 
sources of systematic uncertainties have been investigated, 
the influence of the assumed mass and width in the "train- 
ing" of the fit procedure, and the jet energy scale. 

A possible bias due to the training of the fit procedure 
is studied by using a data sample with a top quark mass of 
175 GeV and a width of 1.5 GeV, compared to 174 GeV 
and 1.37 GeV used in the signal sample in the analysis, for 
the determination of the fit parameters. The results obtained 
with these parameters deviate from the original results by 
less than the statistical uncertainty for both fully-hadronic 
and semi-leptonic events. Since the statistical uncertainty of 
the detector resolution parameters themselves is comparable 
to the overall statistical uncertainties due to comparable inte- 
grated luminosities in the training and in the signal samples, 
there is no indication for a bias introduced by the a priory 
uncertainty of the top quark mass. While small biases can 
not be excluded due to the limited statistics, they are below 
the statistical uncertainties of the analysis. 

The jet energy scale is one of the most important sources 
of systematic uncertainties in top quark mass measurements 
to date. To investigate this effect on the present analysis, 
three separate studies have been performed: the energy of 
all jets was shifted by 2%, the energy of light jets only was 
shifted by 2%, and the energy of b jets only was shifted by 
2%. The kinematic fit, which imposes constraints on the re- 
constructed W mass, limits the influence of a shift in the 
light jet scale to less than 100 MeV. Correlated shifts of all 
jets result in a bias of 200 MeV, while a shift of b-jets only, 
which are not well constrained by the kinematic fit, results 
in a bias of 350 MeV. The influence of shifts in the light-jet 
energy scale can be further reduced by an in-situ calibration 
based on the reconstructed W mass before the kinematic fit. 
For an integrated luminosity of 100 fb~ 1 , the precision of the 
mass obtained with a simple fit of the invariant mass distri- 
bution is better than 100 MeV, which will allow to constrain 
the jet energy scale to better than 1%. While such a possibil- 
ity does not exist for b-jets in top quark pair production, it is 
expected that a similar precision can be reached for b-jets by 
reconstructing Z decays in di -boson production and radiative 
return events. This will bring jet energy scale systematics on 
the top mass measurement at CLIC to a level comparable to 
the statistical uncertainties of the measurement. 



6 Top mass measurement in a threshold scan 

In addition to the direct reconstruction of the top quark mass, 
a linear collider offers the possibility of a threshold scan, 
which allows to measure the top mass in a theoretically well- 
defined way, as discussed above. 

To determine the top mass in a threshold scan, a mea- 
surement of the it production cross section at several points 
around the threshold is necessary. To identify top pair events 
with high purity the same analysis procedure as for the in- 
variant mass measurement is used, albeit with relaxed cuts 
on the quality of the kinematic fit. Since the threshold behav- 
ior of the cross section also depends on the strong coupling 
constant 11251 . the top quark mass is extracted simultane- 
ously with a s . Depending on the precision of a s at the time 
when such a measurement will be performed, the strong 
coupling can alternatively be used as external input, as will 
also be discussed below. Earlier studies [ 26 1 have shown that 
in addition to the mass and the strong coupling constant, 
also the top quark width is accessible in a threshold mea- 
surement, in particular when using other observables such 
as the top quark momentum distribution and the forward- 
backward asymmetry in addition to the cross section. These 
additional aspects of tt threshold measurements are beyond 
the scope of the present paper. 

For the correct description of the cross section near the 
production threshold, the inclusion of high-order QCD con- 
tributions is necessary. Since no appropriate event generator 
is publicly available at present, the study follows the strat- 
egy of earlier studies performed for the TESLA collider [26 1 
by splitting the simulation study into two parts: the determi- 
nation of the event selection efficiency and background con- 
tamination, and the calculation of the top-pair production 
cross section in the threshold region. In this approach, the 
signal selection and background rejection is determined us- 
ing fully simulated top-pair signal events as well as relevant 
background channels at a nominal center of mass energy of 
352 GeV, slightly above the production threshold for the se- 
lected top mass of 174 GeV. For this, the full simulation, 
reconstruction and event selection procedure as described in 
Section[3]is followed. Data points along the threshold curve 
are then generated by taking the signal cross section deter- 
mined using NNLO calculations combined with the selec- 
tion efficiency, adding background events assuming a con- 
stant level over the considered energy range of 10 GeV as 
determined from the full simulations. In the following, more 
details are given on the individual steps. 

In the analysis, we consider a threshold scan with 10 
energy points spaced by 1 GeV from 344 GeV to 353 GeV, 
with an integrated luminosity of 10 fb _1 each, resulting in a 
total integrated luminosity of 100 fb . 
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345 350 355 

[GeV] 

Fig. 4 Top pair production cross section from theory calculations, with 
the luminosity spectrum (LS) of CLIC at 350 GeV and ISR as well as 
for all effects combined. 

6.1 The tt threshold at CLIC 

The top-pair signal cross section is determined using full 
NNLO calculations provided by the code TOPPIK Il8ll27ll. 
The top mass input is set to 174 GeV in the IS mass scheme 
0. The strong coupling constant a s is taken to be 0.118. 
Since TOPPIK provides the cross section in units of R, the 
ratio of o(e + e~ — > X) to o(e + e~ — > the appropri- 

ate conversion factor of the energy-dependent cross section 
e + e~ — > is applied in addition. 

Since this cross section is calculated for the energy at the 
e + e~ vertex, additional corrections for initial state radiation 
(ISR) and for the luminosity spectrum of the collider have 
to be applied. Initial state radiation is numerically folded 
into the cross section calculated by TOPPIK following the 
YFS (Yennie-Frautschi-Suura) solution as given in ll28l . In 
addition, the luminosity spectrum of CLIC operated at 350 
GeV, which is characterized by a main peak containing 77% 
of the full luminosity in the top 1 % of the energy and by a 
long tail to lower energies, is considered. Figure|4]illustrates 
the influence of these effects on the cross section. Both ISR 
and the luminosity spectrum result in a lowering of the cross 
section since part of the collision events are moved to ener- 
gies below the threshold. The tail to lower energies, but in 
particular also the beam energy spread in the main peak of 
the luminosity spectrum, result in a smearing of the cross 
section peak at threshold. 



6.2 Generation of data points 

The signal and background efficiencies are determined using 
fully simulated events as outlined in Section [3] The kine- 
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Fig. 5 Background-subtracted simulated cross section measurements 
for 10 fb -1 per data point, together with the cross section for the gen- 
erator mass of 174 GeV as well as for a shift in mass of ±200 MeV. 



matic fit and the likelihood-based background rejection are 
used to eliminate the majority of the non-ff background. 
Overall, a signal selection efficiency of 70.2%, including 
the branching fractions of the considered fully-hadronic and 
semi-leptonic top pair decays, is achieved. As for the 500 
GeV case, the dominant background channels are rejected 
at the 99.8% level, resulting in an effective cross section for 
the remaining background of 73 fb. 

Simulated data points are generated by taking the ISR 
and luminosity-spectrum corrected top pair cross section at 
the desired energy to calculate the nominal number of events 
expected. The simulated number of signal events is deter- 
mined on a random basis following a Gaussian distribution 
with the mean set to the nominal number of events and the 
standard deviation given by the square root of that number. 
With the same method, background events are added, us- 
ing a constant cross section of 73 fb as discussed above. 
It is assumed that the nominal background contribution is 
well known both from theory and from measurements be- 
low threshold, so the nominal number of background events 
is subtracted from the signal, leaving just the statistical vari- 
ations on top of the signal data with its own statistical un- 
certainty. 

Figure [5] shows the ten simulated data points for CLIC 
with an integrated luminosity of 10 fb -1 at each point. To 
illustrate the sensitivity of the data to the top quark mass, 
the threshold behavior for a shift in mass of ±200 MeV is 
also shown in the figure. 
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Fig. 6 Expected statistical errors from a simultaneous fit of the top 
mass and the strong coupling constant, showing the correlation of the 
two variables and the achieved precision. 

6.3 Measurement of the top mass and a s 

The IS mass of the top quark and the strong coupling con- 
stant are extracted simultaneously with a two-dimensional 
template fit. During the fitting procedure, the simulated data 
points are compared with calculated cross sections ("tem- 
plates") for a grid of different mass and strong coupling val- 
ues, generated with step sizes of 50 MeV and 0.0007 for 
m t and a s , respectively. The fit results are then given by the 
minimum of a two-dimensional parabolic fit to the % 2 dis- 
tribution of the different templates in the m t , a s plane. The 
expected statistical uncertainty of these parameters from a 
threshold scan is taken from the standard deviation of the 
measured mass in 5000 trials with different simulated data 
points. The results are illustrated in Figure [6] which shows 
the clear correlation between the two parameters, and also 
demonstrates that the fit itself does not introduce a bias on 
the results. 

At this stage of the analysis, the systematic error due 
to theory uncertainties is included, taken as an overall nor- 
malization uncertainty of the calculated cross section. Here, 
two levels are considered: A normalization uncertainty of 
3%, assumed as a reasonably conservative estimate of cur- 
rent theory uncertainties [29 1, and an uncertainty of 1% op- 
timistically assumed to be achievable with additional theo- 
retical work in time for experiments at linear colliders. 

The full results, including the theory uncertainty, are giv- 
en in Table [3] 

6.3.1 Alternative scenarios 

In addition to the two dimensional fit with 10 data points, 
other running and analysis scenarios are considered. When 



1 S top mass and a s combined 2D fit 



m, stat. error 


34 MeV 


m, theory syst. (l%/3%) 


5 MeV / 8 MeV 


<X S stat. error 


0.0009 


a s theory syst. (l%/3%) 


0.0008 / 0.0022 



Table 3 Results summary for the 2D simultaneous top mass and a s 
determination with a threshold scan at CLIC for 10 points with a total 
integrated luminosity of 100 fb -1 . 

taking the strong coupling constant as an external input to 
the fit, the statistical uncertainty on the top quark mass re- 
duces to 21 MeV. However, an additional systematic uncer- 
tainty of 20 MeV on the mass arises from the precision of 
a s , when assuming the current uncertainty of the world aver- 
age of 0.0007 [ 30 1 . In addition, the theory uncertainty of the 
overall normalization of the cross section is in that case fully 
absorbed by an uncertainty of the top mass, which amounts 
to 18 MeV and 56 MeV when assuming a 1% and a 3% 
uncertainty, respectively. Since the precision for the strong 
coupling constant achieved in the combined fit is compara- 
ble to the current uncertainty of the world average, there is 
no substantial benefit in using the strong coupling as exter- 
nal input unless the precision improves considerably by the 
time experiments are performed at linear colliders. 

Since the sensitivity to the top mass is mainly provided 
by the region in which the cross section rises steeply a mea- 
surement is also possible with a reduced data set consist- 
ing of just the first six points in the range from 344 GeV 
to 349 GeV, with a combined integrated luminosity of 60 
fb~ 1 . In this scenario, the statistical precision of the top mass 
is reduced by approximately 20% to 40 MeV, with slightly 
reduced theory uncertainties. The uncertainty of the strong 
coupling on the other hand increases by 45% to 0.0013, with 
unchanged theory systematics. This stronger increase in the 
error of the strong coupling constant compared to the top 
quark mass originates from the fact that the cross section 
above threshold, which is not measured in the six point scan, 
is particularly sensitive to the coupling constant. 

6.3.2 Additional systematic errors 

In addition to the theory normalization uncertainty other po- 
tential sources for systematic errors are studied. 

A potential dependence of the result on the choice of en- 
ergy values for the scan in relation to the top mass was ex- 
cluded by shifting the measurement points to higher energies 
by 0.5 GeV without a change in the determined mass and a s 
values. Thus, the precision expected from the LHC will be 
sufficient to determine the range of the threshold scan. 

The precise knowledge of the non-top background after 
event selection is crucial for the measurement of the sig- 
nal cross section. The effect of an imperfect non-top physics 
background description is studied by subtracting 5% and 
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10% too little or too much background before the fit. The 
5% variation results in a 18 MeV shift in the top mass and 
0.0007 in a s , corresponding to approximately two thirds of 
the statistical uncertainty on the top mass and close to the 
statistical uncertainty on a s . The 10% variation leads to a 
shift of twice the size for both values, but also significantly 
reduces the stability of the template fit. This shows that an 
understanding of the background contamination at the level 
of 5% or better is important to keep systematic effects well 
below the statistical uncertainties. One additional data point 
taken below the threshold will provide this required preci- 
sion. 

In addition to these analysis-related uncertainties, also 
machine-related uncertainties, such as the knowledge of the 
center-of-mass energy of the collider and the shape of the 
luminosity spectrum are relevant for this study. Previous ex- 
perience at LEP OTI and studies in the context of the ILC 
ll32l suggests that a precision of 10~ 4 on the center-of-mass 
energy is readily achievable given the high available inte- 
grated luminosity at each data point, resulting in systematics 
below the statistical errors of the top mass. The knowledge 
of the luminosity spectrum is very important for the correct 
description of the signal cross section, and thus also for the 
precision of the template fit. A full investigation has not yet 
been performed, but a simplified first study indicates that 
a 20% uncertainty of the RMS width of the main luminos- 
ity peak results in top mass uncertainties of approximately 
75 MeV, far in excess of the statistical uncertainties. Realis- 
tic studies of the expected uncertainties of the shape of the 
luminosity spectrum at CLIC are currently under way, but 
are beyond the scope of the present study. Overall, a good 
understanding of the luminosity spectrum of the accelerator 
is crucial to fully profit from the precision achievable in a 
threshold scan. 

Taking these systematic studies together, it is to be ex- 
pected that a threshold scan with 100 fb _1 to determine the 
top mass at CLIC will be systematics limited, resulting in a 
total uncertainty below 100 MeV. 

6.4 The impact of the luminosity spectrum: A threshold 
scan at ILC 

For the experiments, one of the most relevant differences 
between CLIC and the ILC are the differences in the lumi- 
nosity spectrum of the two machines. The influence of this 
difference is studied here by repeating the analysis using the 
ILC luminosity spectrum. The ILC spectrum is character- 
ized by a narrower main luminosity peak, and a slight in- 
crease of the fraction of the total luminosity available in the 
top 1% of the energy. As for the CLIC analysis, an inte- 
grated luminosity of 10 fb~ ! per point is assumed. Figure 
[7] shows simulated data points for a threshold scan at ILC. 
Compared to the CLIC case shown in Figure [5] the cross 
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Fig. 7 Background-subtracted simulated cross section measurements 
with the ILC luminosity spectrum for 10 fb -1 per data point, together 
with the cross section for the generator mass of 174 GeV as well as for 
a shift in mass of ±200 MeV. 



1 S top mass and a s combined 2D fit 



m t stat. error 


27 MeV 


m, theory syst. (l%/3%) 


5 MeV / 9 MeV 


<X S stat. error 


0.0008 


a s theory syst. (l%/3%) 


0.0007 / 0.0022 



Table 4 Results summary for the 2D simultaneous top mass and a s 
determination with a threshold scan at ILC for 10 points with a total 
integrated luminosity of 100 fb _1 . Event selection and background re- 
jection from CLIC_ILD is used. 



section rises faster due to the sharper main luminosity peak 
at the ILC. This faster rise of the cross section is expected to 
lead to somewhat reduced statistical uncertainties on the top 
mass for a given integrated luminosity due to increased dif- 
ferences between different mass hypotheses in the threshold 
region. 

For the generation of data points with the ILC luminosity 
spectrum, the signal selection efficiencies and the residual 
background contribution are determined with the CLIC_ILD 
detector concept. While there are some differences between 
this detector concept and the ones developed for the ILC, it 
is not expected that this will have a sizeable impact on the 
efficiencies in the present study. 

Figure [7] and Table [4] summarize the results of the com- 
bined extraction of the IS top mass and the strong coupling 
constant at ILC. As expected, the statistical uncertainties are 
reduced compared to a threshold scan at CLIC, with a 20% 
reduction of the uncertainty of the mass and a 10% reduc- 
tion of the uncertainty of a s . The theory systematics as well 
as other systematic uncertainties studied here are unchanged 
compared to those at CLIC. Thus, the difference in statisti- 
cal precision provided by the two different collider concepts 
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Fig. 8 Expected statistical errors from a simultaneous fit of the top 
mass and the strong coupling constant using the ILC luminosity spec- 
trum, showing the correlation of the two variables and the achieved 
precision. 

does not result in a significant difference of the overall pre- 
cision of the top mass measurement in a threshold scan. 



and comparable to or larger than the statistical error on the 
strong coupling. Additional systematic uncertainties from 
the beam energy, from the luminosity spectrum and from 
the background subtraction are comparable or smaller than 
the statistical uncertainty on the mass, resulting in a total 
uncertainty of the top mass below 100 MeV in a theoreti- 
cally well-defined mass scheme. The differences in luminos- 
ity spectrum between CLIC and ILC only lead to small dif- 
ferences in the statistical precision, far below the expected 
systematic uncertainties. 

In conclusion, these studies confirm the expectation that 
a linear e + e~ collider will be capable of measuring the mass 
of the top quark at the 100 MeV level, substantially beyond 
the precision expected at the LHC. The results demonstrate 
the benefits of the well-controlled luminosity spectra which 
allow the use of full energy constraints and shows the pos- 
sibilities provided by the high-resolution detector systems 
being developed for linear colliders. 
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7 Conclusions 

A linear e + e~ collider based on CLIC technology provides 
the capabilities for a precise measurement of the mass of 
the top quark both at and above threshold. We have stud- 
ied the expected precision obtainable in top pair production 
events with a scan around the threshold and with the direct 
reconstruction of the invariant mass of the top decay prod- 
ucts at an energy of 500 GeV, each assuming a total inte- 
grated luminosity of 100 fb . The studies have been per- 
formed with realistic GEANT4-based detector simulations 
including physics and machine-related backgrounds using 
full particle flow event reconstruction. 

Above threshold, the mass of the top quark, here de- 
fined as the invariant mass of the decay products, can be 
measured with a statistical precision of 80 MeV combining 
fully-hadronic and semi-leptonic top pair decays. System- 
atic uncertainties originating from the jet energy scale can 
be controlled to a similar level using the direct reconstruc- 
tion of the W bosons in the top pair decays and Z decays to 
bb from other sources. 

In a threshold scan, the top mass can be determined in a 
theoretically well defined way, here using the IS mass, with 
a statistical precision of 34 MeV together with the strong 
coupling constant, which is determined with a statistical un- 
certainty of 0.0009. The theory uncertainty, incorporated as 
an overall normalization uncertainty of the cross section, is 
substantially smaller than the statistical error on the mass, 
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